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Abstract. Birth— and— death models are now a common mathematical tool to 
describe branching patterns observed in real— world phylogenetic trees. Liggett 
and Schinazi (2009) is one such example. The authors propose a simple birth- 
and— death model that is compatible with phylogenetic trees of both influenza 
and HIV, depending on the birth rate parameter. An interesting special case of 
this model is the critical case where the birth rate equals the death rate. This 
is a non— trivial situation and to study its asymptotic behaviour we employed 
the Laplace transform. With this we correct the proof of Liggett and Schinazi 
(2009) in the critical case. 
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C^ ' 1. Introduction 

Different viral types have phylogenetic trees exhibiting different branching prop- 
erties, with influenza and HIV being two extreme examples. In the influenza tree 
a single type dominates for a long time with other types dying out quickly until 
K^ ■ suddenly a new type completely takes over and the old type dies out. The HIV 

^^ I phylogeny is the complete opposite, with a large number of co-existing types. 

^^ • In [^ a stochastic model is described and depending on the choice of parameters 

f~^ I it can exhibit both types of dynamics. We briefly describe the model after [6]. We 



only keep track of the number of different viral types at each time point t. Let 
f~^ I N{t) denote the number of distinct viral types at time t. In the nomenclature of 

fT^ ■ phylogenetics N{t) counts the number of different species alive at time t. At each 

time point the birth rate is XN{t) and the death rate is N{t). If there is only one 
type alive then it cannot die. Clearly N{t) is a Markov chain with discrete state 
space and continuous time. Each virus type is described by a fitness value that is 
randomly chosen at its birth. If a death event occurs the type with smallest fitness 
^ I dies. This means that only the fitness ranks matter and so the exact distribution 

■ ■ ■ of a virus' fitness will not play a role. 

The main result of [6J is the asymptotic behaviour of the dominating type, 
whether it is expected to remain the same for long stretches of time or change 
often. This is summarized in Theorem 1 [6], 

Theorem 1.1 (Theorem 1, [6 J. Take a G (0, 1). // A < 1 then 

lim P [maximal types at times at and t are the same) = a, 

while if X > I then this limit is 0. 

The proof of this theorem is based on considering successive visits to the state 1, 
in particular denote ti, T2, . . . to be the (random) times between visits of the chain 
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to N{t) = 1 and r„ := n + . . . + t„. In j6] the latter random variable is represented 
as 

n n 

Tn — 2_^ ^i + 2^ -^i' 

i=l 1=1 

where X^ are independent mean 1 exponential random variables and Hi are the 
(independent) hitting times of the state 1 conditional on starting in state 2. De- 
scriptively Hi is the Jth return from state 2 to the state of one virus type alive, since 
from 1 the chain has to jump to two types. The Markovian nature of the process 
ensures that the HiS are independent and identically distributed for distinct is. In 
the proof of Theorem 1 it is stated that the cumulative distribution function of iJ^, 
F(i), satisfies, 



(1-1) / . , I o d^-^^ 



F(t) 

I + ,s2 - 2s ' 



t 



solved uniquely for, 

(1.2) F{t) 
^ ' ^ ' 1 + t 

This gives the asymptotic behaviour (Lemma 3 Ji,), 

Tn 

(1.3) lim — - — - — - = 1 in probability, 

ri^oo nlog(n) 

from which the result of Theorem 1 is derived when A = 1 . 

However Eq. (|l.ip does not take into account that this model differs from a 
classical birth-death model where is the absorbing state. We illustrate this in 
Fig. [TJ We can easily re-numerate the state values, but the intensity values will 
differ between the two models. Correcting for this difference in intensity values one 
will still get the same asymptotics as in Eq. ()1.3p and hence the same result as in 
Theorem 1 but with a more complicated proof. Below we present a correct proof 
of Lemma 3 IT in the case of A = 1, based on Lemmas 12.11 and 12.21 From this the 
statement of Theorem 1 of [6] follows. 

2. Auxiliary lemmas 

Lemma 2.1. Let Pi{t) = P{Hi < t) = F{t). Then Pi{t) solves the renewal 
equation 



'■"^ ^■'"^irfip + rir 



7Piiy)dy. 



it-y)r 

Proof. In panel A of Fig. [l]we can see a representation of the studied Markov chain 

on the state space S = 1,2, 3, Due to the HiS being independent for different 

is we can study the distribution of Hi and treat 1 as an absorbing state. Let P„ (t) 
denote the probability of being in state n at time t, when one starts in state 2 at 
time 0. The system of differential equations describing the probabilities is, 

Piit) ^2P,{t) 
(2.2) { P^it) =-4F2(t) + 3P3(i) 

P^(i) = -2nPn{t) + (n + l)Pn+iit) + (n - l)-P„-i(i), n>3 
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Figure 1. A: Depiction of the Markov chain model described in 
[B]. B: Depiction of a classical Markov chain model for which Eq. 
p. II) would be correct. The numbers inside the circles are the 
states (counting the number of virus type) and the numbers above 
and below the arrows are the birth and death rates respectively of 
the state from which they come out. 



with initial conditions, 

(2.3) Fi(0)=0, F2(0)-l, F3(0) = ... = P„(0) = ...=0. 

Let P{s,t) denote the generating function of the sequence {Pn{t)}^^i, i.e. 



P(s,t) = ^P„(i)s". 

n=l 

Taking first derivatives we get the partial differential equation 

(2.4) 



^^(^'^)^-(.-l)^P,(0 + (.s-l)^'^(^'^) 



dt 



ds 



with initial conditions 
dP{s,t) 



(2.5) 



ds 



.;o^Pi(t) P{s,t) |.^.= ^P„(0)s" = ,s2. 



Following §2.1 [2 and using the substitution z{x) = P{h{s,x),t + x) with 

1 



h{s, x) :— I + 



s^l, 



s-l 



we arrive at, 



P{s,t)-( -1 1; \M = z(0)-z'(-t) = f z\x)dx^ f i -^Pi{v)Av. 

\l-t{s~l)) J_, J, ^y_t+^) 

Evaluating the derivative of both sides with respect to s at we find that the 
function Pi{t) must satisfy the following integral equation (we can recognize it as 
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a renewal equation), 

(2.6) g(0,t) = P.W-^ + { (,^(;_^)). P.(,)d,. 



D 



Lemma 2.2. i^(i) — Pi{t), the solution of the renewal equation 112.1]) . has the 
following properties, 

(2.7) / tF'{t)dt - log(/i) as /i ^ oo, 

Jq 



(2.8) 



/ t^F'{t)At ^ h as h ^ oo. 
Jo 



Proof. The proof is based on Tauberian theory and we refer the reader to [31 13] for 
details on this. Another approach would be to study the asymptotic behaviour of 
F'{t) by renewal theory results (see e.g. [IlIS]). The Laplace transform of a density 
function /(a;), denoted f{x){s) is, 

/■oo 

mis) = / e~^^f{x)dx. 
Jo 

We will use the following theorem from [3], Theorem 2 §XIIL5, 

Theorem 2.3 (Theorem 2 §XIIL5, 3 ). // L is slowly varying at infinity and 
< p < oo, then each of the relations 

(2.9) mis)= f fit)cM-st)dt^s-PLi-), s^O, 

Jo s 

(2.10) F{t) = ^ f{u)du ^ ^,J-^tPL{t), t -> oo 

implies the other. 

F{t) = Pi{t) defined as the solution to the renewal equation (|2.ip after differen- 
tiating will satisfy 

(2-11) F'{t) = rr + / 7 ^F'{y)dy 

We calculate the Laplace transforms of F'{t), tF'{t) and t^F'{t), 
(2.12) 

nr)(.)=(f^(.)+(^(.).nr)(.) 

(2.13) 

VT7f(^)+(rHF(^)-^^^)(^)+(TTt) 

(2.14) 

t'F'{t)is) = \^r^is) + (i^(^) • nt)is) + (rT7)^('^) • tnm + (Ttt^ (^) • t^p'^s). 



tF'{t){s) = V^(*) + TTtHw^^) • ^'W(^) + TTT^i') ■ tF'it)is) 
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We are interested in the behaviour of the transforms as s -> 0, and for this we will 
use the well known property (verifiable by the de L 'Hospital rule) of the exponential 
integral, 

/ du log(s), 

Js u 

to arrive at, 

^#(^) + nw(^)-^)(^) 1 

(2.15) tF'{t){s) ^ ^'+*^ ^!1!L ^^^-log(-), s^O 

(2.16) 

(2.17) 

.(i^(t))(.) . S?(-^ - ^(-^ tS;(-) ■ (^^^))(-) , 1 _o. 

1 ^ IT+tF*^'*-' 
As both the constant function (s*^) and log(l/s) are slowly varying functions for 
s — )■ the Tauberian theorem allows us to conclude that, 

(2.18) f tF'(i)di - log(/i) 

Jo 

(2.19) / t^F'{t)dt - h. 

Jo 

D 

3. Proof of Lemma 3 [6] 

We will now use Lemmas l2.ll andl 2.2l to prove Lemma 3 from [6 . Define as there, 
rPn rPn 

rrin := / tF'{t)dt and s„ := / t^F'{t)dt, 
Jo Jo 

with pn '■= n-y/log(n). By Lemma 12.21 we know that, 

m„ ~ log((0„) -- log(n) and s„ ~ /9„. 

We now need to check how n(l — F(p„)) behaves asymptotically. We do not know 
what F{pn) is but using the Tauberian theorem and Eq. p.l7p from the proof of 
Lemma [2^ we get that, 

(3.1) / u(l- F{u))dur^t, t^oo. 

Jo 

Therefore using integration by parts and Eq. (|2.19p , 

j;;uil^Fiu))du _ i^il -F{t)) !liF'{u)du 
t ~ t ^ t ' 

lJ{t{l-F{t))) + ]-, t^oo, 
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and so we arrive at 

(3.2) lim i(l - F{t)) -^ 1. 

t—>oc 

The rest of the proof is a direct repeat of the one in [6] and so we get (as in [6l) 
that Tn/{n\og{n)) tends to 1 in probabihty implying Theorem 1 [61 for A = 1. 

If we apphed the same chain of reasoning to the model of panel B in Fig. [1] 
starting off with the system of differential equations, the analogue of Eq. (12. 4p 
would be a homogeneous partial differential equation, 

with initial conditions 

(3.4) ^^fll) |..„^p^(t) P(,,i)|_=,, 

in agreement with Pi(t) = t/(t + 1). It would therefore be an interesting problem 
to see what conditions are necessary on the nonhomogeneous part of Eq. (|2.4p to 
still get the same asymptotic behaviour of the Markov chain and what underlying 
model properties do these conditions imply. 
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